Relational Data Access for Business Data Analytics

نویسندگان

Veit Köppen

Andreas Lübcke

چکیده

Data, information, and knowledge are dramatically increasing (Korth & Silberschatz, 1997; Naydenova & Kaloyanova, 2010). Although, Data Warehouses are a central access for business data since the 90s, new technologies have to be considered to achieve an efficient access and processing of these multidimensional data. Recently, new architectures have evolved that try to optimize data access for certain applications. Database systems (DBS) are pervasively used for all business domains. Therefore, DBS have to manage a huge amount of different requirements for heterogeneous application domains. New data management approaches are continuously developed, e.g., new trends are NoSQL-DBMSs (Chang et al., 2006; DeCandia et al., 2007), MapReduce (Dean & Ghemawat, 2008), Cloud Computing (Armbrust et al., 2009; Foster, Zhao, Raicu, & Lu, 2009; Buyya, Yeo, & Venugopal, 2008), to make the growing amount of data manageable for new application domains. However, these approaches are developed for specific applications and need a high degree of expert knowledge. From a technical point of view, there exist different opportunities to access, process, and analyze data in a more efficient way. On the one hand, the usage of hardware, due to decreasing cost is often a suitable way. On the other hand, this requires techniques that are developed or optimized for main memory usage in data warehousing. Another possibility is to use specialized storage models. Thus, it is possible to store data on all aggregation levels in multidimensional online analytical processing (MOLAP), e.g., the data cube, or only the most interesting data, e.g., iceberg cubes. Furthermore, the data access can be enhanced by considering the architecture. Since the 70s, relational database systems use row stores, that means, tuples (rows of a table) are stored sequentially. In contrast, column stores store data in such a way that attributes (columns of a table) are stored sequentially. This enables efficiency for data access in the domain of data warehousing due to a better access for aggregations. Another challenging optimization is the selection of a suitable index structure. Multi-dimensional analyses have to be supported. Dependent on domain, data, and application scenario different index structures can enhance data processing. In this chapter, we provide an overview of architectural decision. We focus on the storage architecture for relational database systems.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Fuzzy TOPSIS Approach for Big Data Analytics Platform Selection

Big data sizes are constantly increasing. Big data analytics is where advanced analytic techniques are applied on big data sets. Analytics based on large data samples reveals and leverages business change. The popularity of big data analytics platforms, which are often available as open-source, has not remained unnoticed by big companies. Google uses MapReduce for PageRank and inverted indexes....

متن کامل

Big Data Analytics and Now-casting: A Comprehensive Model for Eventuality of Forecasting and Predictive Policies of Policy-making Institutions

The ability of now-casting and eventuality is the most crucial and vital achievement of big data analytics in the area of policy-making. To recognize the trends and to render a real image of the current condition and alarming immediate indicators, the significance and the specific positions of big data in policy-making are undeniable. Moreover, the requirement for policy-making institutions to ...

متن کامل

Hybrid: A Large-scale In-memory Image Analytics Engine

Analytical image/video processing tasks such as scene/face/activity recognition are historically performed outside most relational database management systems. Relational engines are optimized for relational data, hence naturally have weaker support for non-relational data such as images or video. Hybrid, a high-velocity in-memory analytics engine, supports advanced access capabilities to both ...

متن کامل

Hybrid Enterprise Data Lakes Provide Foundation for Disruptive Business Intelligence_161116.cdr

Enterprises are looking beyond traditional data warehousing practices to fulll their business intelligence (BI) requirements. As the need to make accurate and timely decisions increases, enterprises seek real-time access to structured and unstructured data from multiple streams and logs. A growing number of enterprises are exploring cloud and Big Data platforms to address this need. Moreover, ...

متن کامل

The Evolving Role of the Enterprise Data Warehouse in the Era of Big Data Analytics

Simple analysis of all the data trumps sophisticated analysis of some of the data. 23 Executive Summary In this white paper, we describe the rapidly evolving landscape for designing an enterprise data warehouse (EDW) to support business analytics in the era of "big data. " We describe the scope and challenges of building and evolving a very stable and successful EDW architecture to meet new bus...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2016

Relational Data Access for Business Data Analytics

نویسندگان

چکیده

منابع مشابه

A Fuzzy TOPSIS Approach for Big Data Analytics Platform Selection

Big Data Analytics and Now-casting: A Comprehensive Model for Eventuality of Forecasting and Predictive Policies of Policy-making Institutions

Hybrid: A Large-scale In-memory Image Analytics Engine

Hybrid Enterprise Data Lakes Provide Foundation for Disruptive Business Intelligence_161116.cdr

The Evolving Role of the Enterprise Data Warehouse in the Era of Big Data Analytics

عنوان ژورنال:

اشتراک گذاری